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, A new object of the probability theory, two-sided chain of events (symbols), is introduced. A 

theory of multi-steps Markov chains with long-range memory, proposed earlier in Phys. Rev. E 
68, 06117 (2003), is developed and used to establish the correspondence between these chains and 
two-sided ones. The Markov chain is proved to be statistically equivalent to the definite two-sided 
one and vice versa. The results obtained for the binary chains are generalized to the chains taking 
on the arbitrary number of states. 
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pH , I. INTRODUCTION 

' The problem of long-range correlated random symbolic systems (LRCS) has been under study for a long time in 



many areas of contemporary physics 1, 2, 3, 4, 5, 6], biology 7, 8, 9, 10, 11, 12], economics 8, 13, 14], linguistics 0, 
lT6 lll7llTlll9| . etc. 

^ ' Among the ways to get a correct insight into the nature of correlations of complex dynamic systems the use of the 
^ multi-step Markov chains is one of the most important because they give a possibility to construct a random sequence 
■ ^ with necessary correlated properties in the most natural way. This was demonstrated in Ref . |20| , where the concept 
^»~,, of Markov chain with the step-wise memory function, which consist in coordinate independence of the conditional 
' probability, was introduced. The concept of additive chains turned out to be very useful due to the ability to evaluate 
Oh' the binary correlation function of the chain through the memory function (see for the details Refs. (mI^^). The 
correlation properties of some dynamic systems (coarse-grained sequences of the Eukarya's DNA and dictionaries) 
y—( I can be well described by this model |23] . 

J> . Another important reason for the study of Markov chains is its application to the various physical objects I 2lll2^l23l ]. 
' e.g., to the Ising chains of spins. The problem of thermodynamics description of the Ising chains with long-range spin 
interaction is opened even for the ID case. However, the association of such systems with the Markov chains can shed 
light on the non-extensive thermodynamics of the LRCS. 

Multi-step Markov chains are characterized by the probability that each symbol of the sequence takes on the definite 
value under condition that some previous symbols are fixed. This chains can be easily constructed by the consequent 
^ generation using prescribed conditional probability function. Besides, the statistical properties of Markov chains can 
be determined in some simple cases. At the same time, there is another class of correlated sequences, the so-called 
^ ' two-sided chains. They are determined by the probability that each symbol of the sequence takes on the definite value 
' ^ , under the condition that some symbols at the both sides of the chosen symbol are fixed. An example of systems with 
J>sj' such property is the above-mentioned Ising chain. But the approach, used for the finding of Markov chains properties 
, (the probability of concrete "word" occurring, the correlation functions, and so on) unfortunately cannot be used in 
Oh' this case. In this paper we prove, that such mathematical objects, determined in the Sec. (|II A|l as two-sided chains, 
are the Markov chains. So, the statistical properties of Markov chains and the method of their constructing can be 
used for the studying the two-sided chains. 

The paper is organized as follows. In the first Section we give the definition of Markov and two-sided chains. 
The next Section is devoted to the proof of the main statement: the first Subsection contains the proof of the direct 
statement, that every binary Markov chain is in the same time the binary two-sided one; the second Subsection shows, 
that the classes of these two chains coincide. Finally, in the last Subsection we generalize this results to the case of 
non-binary chains. 
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II. BASIC NOTIONS 



A. General definitions 



Let us determine the N-step Markov chain. This is a sequence of random variables at, i = —M, ~M + 1, . . . , M 
(M 3> N), referred to as the symbols^ which have the following property: the probability of symbol to have a 
certain value under the condition that the values of all previous symbols are fixed depends on the values of N 
previous symbols only, 

P{ai = a| . . . , ai_2, flj-i) = P{ai = a\ai^N, • ■ ■ , ai-2, a^-i). (1) 

Such defined chain is a stationary one, because the conditional probability does not depend explicitly on i.e., does 
not depend on the position of symbols Oj-at, . . . , ai_i, in the chain and depends on the values of the symbols and 
their positional relationship only. 

The chain under consideration is defined for arbitrary but finite length M . Nevertheless, all results that will be 
obtained do not depend on M . Thus, they are correct in the infinite limit provided that the conditional probability 
function is fixed. 

In different mathematical and physical problems we confront with the sequences for which the probability of 
symbol to have certain value, under the condition that the values of the rest of symbols are fixed, depends on a 
value of N previous and N next symbols only, 

P{ai — a\. . . , ai-2, Q-i-i; <ii+i, 0,1+2, ■ ■ ■) — 



= P{p,i = a|ai-Af, ■ • ■ , fli+i, . . . , Oi+Ar). (2) 

Let us name these chains as N -two-sided ones. By the same reason as above, see Eq. this chain is a stationary 
one. 

An important class of the random sequences is the binary chains. If each symbol of the chain can take on only 
two values, sq and si, then we refer to this chain as a binary. It is convenient to change a value of to and 1 using 
a linear transformation, 

Oi - So 

Oi := . 

si - So 

Now, we will describe the ways of the constructing of the defined chains. 



B. Constructing of the chains 

The Markov chain defined in such way is simple for numerical simulations. There are two basic approaches for this. 
In both of them we find successively the each next generated symbol by A'' previous ones. But these approaches differ 
in the method of constructing for the first N-word, the set of N sequent symbols. 

For first approach one needs to calculate in addition some conditional probabilities. They can be found from the 
compatibility equation for the conditional probabilities: 

J2 ■ ■ ■ J2 P{ai = a\ai-N, ■ ■ ■ ,ai-i)Piai-N, ■ ■ ■ ,at-i) 

P{ai^a\a,^k,---,ai-i) ^ ""'^ ^ ^ ■ (3) 

1^ ■■■ 1^ P[ai-N, ■ ■ ■,ai-i) 

Here A; = 0, . . . , iV — 1 and the sign ^ means summation (or integration) over all possible values of symbol aj. The 

aj 

probabilities of A^- words occurring, P{ai^N, . . . , ai_i), should be obtained from the following linear system, 

P{ai,a2, ■ ■ . ,aAr) = -P(aiv|ao, ai, ■ • ■ , aAr_i)P(ao, Oi, . . . ,ajv_i), 
E ■ • ■ E P{ai,a2, . . . , un) = 1. 

ai ajv 
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Using Eq. (O we can construct the first N symbols consequently generating them in accordence to the following 
conditional probabilities: 

P{ai), P{a2\ai), P(a3|ai, 02), . . . , P{aN\ai, ... , aN-i). 

The second approach is based on the random choice of A^-word. The second approach is simpler than the first one 
because it does not make the calculation of additional probabilities. But it does not allows to get the stationary chain, 
as it is possible in the first method. For generation of the same chain using the second approach we must construct 
as many symbols as one needs to get the stationary chain (the initial part of the chain should be removed). 

There is no simple method for two-sided chains constructing. The best known and simple approach is Metropolis' 
algorithm, but it needs much more operations than constructing of the Markov chain. Therefore it is very important 
to prove the equivalence of the Markov and two-sided chains. 

III. EQUIVALENCE OF THE MARKOV AND TWO-SIDED CHAINS 

In this section we prove an equivalence of two random sequences, the Markov and two-sided chains. The proof is 
produced for a binary chain, but it can be directly generalized for arbitrary chains (see Subsection IIII Cl for details). 
This proof requires some formulas for a conditional probability. Its definition is 

PiA\B) ^ -j^. (5) 

Here and below comma between the symbols-events means that both of these events occur simultaneously, it is a 
product of two events, {A, B) — A^B. Using evident equation, 

P{A,B\C)^P{A\B,C)P{B\C), (6) 

the following formula can be easily obtained, 

PiA\B C) = P(A^B\C)_ 
^ ' ' ' P{A,B\C)+P{A,B\C)' ^ ' 

where A is an event opposite to A. 

A. From the Markov to two-sided chain 

Let us demonstrate that a Markov chain is a two-sided one. For this purpose using Eq. iQ) we rewrite the probability 
for symbol a; to be equal unity, under the condition that the values of the rest of symbols are fixed, in following 
form: 



^1^^'^^) P{a,^l,At\A-) + P{a-=0,At\A-y 



where = (. . . ,ai-2,ai-i) and Af = (0^+1,0^+2, . . .). 

To obtain the value of P{ai = \, Af\A~) one needs to use Eq. many times to express P{.\.) as the product: 

P(a, - l,A+\Ar) = P(a, = l|Ar)P(a,+i, Ja, - \,Ar) = 
= P{a, = l\A-)P{a^+l\ai = I, Al)P{ai+2, A+_^^\Al ,ai = l,a,+i) = ... 



M 



n ^Ki^.")- (9) 



However the chain under consideration is the iV-step Markov one and, according to definition (Q, the probability of 
symbol , under the condition that the values of all previous symbols are fixed, depends on the values of N previous 
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symbols only. So, the factors of the product for r > i + A'' in Eq. do not depend on a^. Substituting expression 
for P{ai — l,Af\A~) into Eq. |(SJ) we derive the following equation, 

N 

n P{a,+r\Trir) 

r=0 

P(a. = l\Tr,T+) = — '-^ . (10) 

n P(a.+.|T,;,,) + n P{a.+r\T-,r) 

r—Q r = 

Here T~ — (aj_jv, . . . , aj-i) and T^^ = i^j+ij • • • j o-j+n) are previous and next words of the length N with respect to 
symbol aj. 

Equation l|10() is the fundamental relation for association of Markov and two-sided chains. One can see from it that 
the probability of symbol at under the condition of fixed values of the rest of symbols is determined only by two 
words of the length N, T~ and . So, according to definition ((2Jl, the Markov chain is the two-sided one, quod erat 
demonstrandum. 



B. From two-sided to the Markov chain 



Now we prove the opposite statement: the two-sided chain is a Markov one. I.e., we prove that the probability of the 
symbol to be equal to unity, under the condition that all previous symbols are fixed, depends on the values of N 
previous symbols only. Thereto, let us take two sets of symbols A' and A" which are two variants of the word A^_j^ 
and differ only by one symbol ai^^^k at arbitrary value of fc > 0. 

Using definition of the conditional probability ((Sj) we obtain 

^_ ^ P{A',Tr,a,) ^ P{a[_^_^\A,Tr ,a,)P(A,T- ,a,) ^ 
' ' PiA',T-) Pia[_j,_,\A,Tr)PiA,Tn 



Pia'^_^_,\A,Tn 

where A is a set of symbols A' (or A") except for symbol ai-M-k- However, according to the definition of two-sided 
chain, conditional probability P{a^_j^_fJ\A,T~ ,ai) does not depend on symbol since the latter is situated at a 
distance more than N from ai-M-k- Hence one gets 

P{a.\A',Tr)=P{a^\A,Tr)=P{a^\r,Tr). 

So, we find that probability P{ai\A~) takes on the same value for any arbitrary word A~_j^. We conclude that the 
probability does not depend on ^4^^^. Thus we attest ourselves that 

P{aMi)^P{a^\Tn■ 

In other words, according to definition (Q, the two-sided chain is a Markov one, quod erat demonstrandum. 

It should be emphasized that every two-sided chain is equivalent to the single Markov one though it is not evident 
because of the non-linear structure of Eq. H1U|I . Using trivial expression of Eq. (O , 

^(«.|t;-) = ^^^^, (11) 

one can easily make sure that a single chain cannot have two different conditional probabilities. The matter is that 
the probabilities of N- and (iV -I- l)-words occurring determines uniquely the conditional probability according to 
Eq. Hll|l. Hence, for the chain under study the Markov conditional probability is determined uniquely. 
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C. The case of non-binary chain 

The results obtained in previous Sees. IIII XI and UlI Bl can be generalized to non-binary chains. And we can develop 
the similar proof and get the following equation connecting the conditional probability functions, 

N 

n Pia^+r\T-^,) 

P{a,^a\Tr,T+) = ^^ , (12) 

that is analogue of Eq. H10(l . 

In this formula we used the following notations: 

- if symbol a takes on the finite set of values A then we use the conditional probabilities P{a\ . . .); 

- if symbol a takes on the continuous set of values A then we used conditional probability density P{a\ . . .) and 
sign J2 means / d^. 

5e^ A 

Thus, the equivalence between the N-two-sided and N-step Markov chains is proved for non-binary chains also. We 
found the very important formula for the conversion the Markov's conditional probability to the two-sided one and 
inversely. This method can be used for numerical and analytic calculations of the conditional probabilities. 



IV. CONCLUSION 



Thus, we proved that the classes of the "one-sided" Markov chains and two-sided ones coincide. The obtained 
relationship between the conditional probabilities (or its densities in the case of continuous distribution of values 
taking on by the elements of the chains) allows to construct numerically the Markov chain possessing the same 
statistical properties as the initial two-sided one. So, two-sided sequence can be easily reproduced numerically with 
conservation of all statistical properties but not binary correlation function as it was done in the papers 1241 125| . 

Besides, found Eq. (|12|l allows to use results of analytical studies of Markov chains (for example, see '25'|) for the 
two-sided sequences. This can be very useful for the study of physical systems. The example is the Ising chain, that 
is the two-sided one. 
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